First, basic low-level character encoding differences can have a huge impact on the general searchability of data